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■ Abstract 
> ■ 

Consider the following stochastic block model of a random graph 
consisting of two clusters of size approximately n/2 . The cross-class 
edge probability is a/n and the within-class probability is b/n. Decelle 
. et al. conjectured a threshold for the algorithmic problem of recon- 

^ ■ structing the hidden labels in a way that is correlated with the true 

0^ . partition. Their conjecture is that the threshold is (a - b)^ = 2{a + b) 

■ which is exactly the threshold for the corresponding reconstruction 

problem on trees. 

We prove one side of this conjecture, i.e., that reconstruction is 
. impossible when (a - b)^ < 2{a + b). Moreover, we show that the 

■ stochastic block model is contiguous to an Erdos-Renyi model when 

(a - 6)^ < 2(a + b) and orthogonal to it when (a - 6)^ > 2(a + b). 

^ . 1 Introduction 

.5^- 1.1 Stochastic Block Models 

The study of random networks has seen a surge of interest, driven partly by 
social and biological applications. Random networks that exhibit a "com- 
munity" or "cluster" structure are of particular interest and many such 
models are now being studied; a discussion of similar models can be found 
in [20] . In these models, a collection of vertices are divided into several 
classes and then a random graph is drawn in some way that depends on the 
class membership. In many models, such random graphs tend to feature 



^Supported by NSF grant DMS-1106999 and DOD ONR grant N000141110140 



1 



many connections between vertices of the same class and fewer connections 
between vertices of different classes. 

Perhaps the simplest random graph model with a community structure 
is the stochastic block model, in which vertices are assigned independent 
random labels and then the edges are chosen independently, with the proba- 
bility of an edge appearing depending only on the label of its endpoints. The 
stochastic block model and its relatives have been studied since the 1980's, 
with literature extending across physics, statistics and computer science. It 
was introduced by Holland et al. [16], while a similar model was proposed 
independently by Bui et al. [5]. 

The simplest of these models may be defined by first assigning each 
vertex to one of the two clusters independently and uniformly. Then each 
potential edge {u,v) of the graph is included independently with probability 
a/n if u,v belong to the same cluster and b/n if they belong to different 
clusters. It is this model that we will discuss in the current paper. 

In physics and statistics, such models were motivated by physical, biolog- 
ical, and social systems - see |13j for a survey discussing these motivations. 
In computer science, on the other hand, the study of such models was driven 
by the study of the graph bisection problem, which is computationally hard 
in the worst case [iAi, but is easy on average under the stochastic block 
model with suitable parameters. We should note that the stochastic block 
model is usually called the "planted partition model" in the computer sci- 
ence literature. 

1.2 Block Reconstruction 

In all areas, the most prominent question is that of "community reconstruc- 
tion." Given a random graph with the vertex labels erased, is it possible to 
reconstruct the vertex labels just by looking at the graph structure? This 
question has been studied many [Illll[6l[8l[IIl[I71[IS[ZIl[25l[26] times, with 
many different methods. For example the results of [Tl] (following similar 
results for a slightly different model) imply that if a - 6 > n^''^"'"^ then the true 
partition can be found with high probability. The results of [21] imply that 
the true partition can be found with probability 1-6 if {a-b)'^/a > Clog{n/6) 
where C is a large constant (see [Il|25] for related results). 

Each of the above works contains an algorithm that will exactly re- 
construct the true vertex labels with high probability under certain model 
parameters. 

When the goal is perfect recovery of all the labels, the community recon- 
struction question only makes sense for random graphs with degrees tending 
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to infinity. A random sparse graph on n nodes with average degree less than 
some smah constant times logn will contain many isolated vertices whose 
labels clearly cannot be inferred. Even when looking at the connected com- 
ponent it is easy to see that many vertices with small degrees will be inferred 
incorrectly when a,b> 0. 

1.3 Sparse Block Models and Our Contribution 

For sparse graphs it is natural to propose a relaxed problem in which one only 
seeks to find a labelling that is positively correlated with the true labelling. 
This problem was recently studied by Coja-Oghlan [7] who proved that this 
can be done for suitable choices of the model parameters. In particular, in 
the setting of the current paper this can be done when (a - b)^ > C{a + b) 
for some large constant C. 

More recently, Decelle et al. [9l[T0] conjectured, based on deep but non- 
rigorous ideas from statistical physics, a threshold that separates the pa- 
rameters for which reconstruction is possible from the parameters for which 
it is not. Their conjecture is that it is possible to reconstruct a correlated 
partition if > 2(a + 6), but that it is not possible if (a-b)^ < 2{a + b). 

We will rigorously prove some of the ideas from their paper. 

The conjecture of l9j relates the block reconstruction problem to the 
tree reconstruction problem, see e.g |22) . Consider the following multi-type 
branching process where there are two types of particles named ±. Each 
particle gives birth to Poisson with parameter a particles of the same type 
and a Poisson with parameter b particles of the complementary type. In the 
reconstruction problem, the goal is to recover the label of the root of the 
tree from the labels of level r where r oo. 

Results of Kesten and Stigum for multi-type branching processes [19] 
imply that if (a - b)^ > 2{a + b) then it is possible to recover the root value 
with non-trivial probability and results of Evans et al. imply that if < 
2(a + 6) then it is impossible to recover the root with non-trivial probability. 

It is not hard to imagine that there is a connection between the two 
problems. If we look at a neighborhood of a vertex in the random graph 
model, it is a tree with high probability and the tree distribution is given in 
the limit by the multi-type branching process defined above. 

As part of this work, we make this connection rigorous. By doing so, 
we prove one side of [Qj's threshold conjecture: we show that it is impos- 
sible to recover the labels below the threshold - when (a - b)^ < 2(a + b). 
Then, we show that the conjectured threshold is a threshold for a different 
question: below the threshold, a graph from the stochastic block model is 
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almost indistinguishable from an Erdos-Renyi random graph, but above the 
threshold they can be distinguished with probability tending to 1. 

2 The model and the main results 

The stochastic block model Q(n,p,q) is a model for random, {±}-labelled 
graphs on n nodes - more generally, there can be more than two labels but 
we will only consider the two-label case here. It is easiest to describe this 
model by saying how to sample from it: to generate a pair (G, a) ~ Q(n,p, q), 
we first choose a uniformly random labelling a e {±}". Then for every pair 
{u,v} independently, we draw an edge between u and v with probability p 
if au - CTy and with probability q otherwise. Of course, when p - q then the 
model Q{n,p,q) is the same as the Erdos-Renyi model Q{n,p). 

Equivalently, we can specify Q{n,p,q) by writing down its probability 
mass function: let 

if Gu - CTy and {u, v} e E{G) 
if cJu t (jy and {u,v} e E(G) 
if cJu = ay and {u,v} ^ E{G) 
if au * ay and {u^v} ^ E{G). 

Then 

P(G,c7) = 2-" n K.(G,c7). 

In this article, we will be focused on the sparse case, where both p 
and q are 0(l/n). Therefore, let us introduce two new parameters a,b > 
and set p - ^, q - ^. From now on, P„ will denote the distribution 
g{n,^,^). Although is a joint distribution on both graphs and labels, 
we will sometimes write G ~ P„ when we really mean that G is drawn from 
the marginal distribution. 

Suppose that a+b > 2, so that G ~ P^ has a giant component. Then there 
is some hope of reconstructing, from the unlabelled graph, a labelling r that 
is correlated with the true labels a in the sense that P(tu = Ty\au - ay) > ^ 
for any fixed pair u,v of vertices. Decelle et al. conjectured that label 
reconstruction is possible if (a - 6)^ > 2(a + b) and impossible if (a - b)^ < 
2(a + b). Our first result is a partial answer to their conjecture; specifically, 
we show that reconstruction is impossible when (a - b)^ < 2(a + b). Note 
that our result includes the case (a - b)^ - 2{a + b), for which Decelle et al. 
did not conjecture any particular behavior. 
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Theorem 2.1. If a + b> 2 and (a-b)^ < 2{a + b) then, for any fixed vertices 
u and V, 

In particular, this shows that even an easier problem cannot be solved: 
if we take two random vertices of G, no algorithm can tell whether or not 
they have the same label. This is an easier task than label recovery because 
we no longer ask the algorithm to label all the vertices; we only ask it to 
say whether two of them have the same label or not. 

As for the other half of the conjecture, Coja-Oghlan [7] showed that 
reconstruction is possible provided that (a -6)^ > C{a + b) for some unspeci- 
fied constant C. What remains open, therefore, is to show that one can take 
C = 2. 

The proof of Theorem 12.11 follows from a connection with Markov pro- 
cesses on trees. It is well-known that a small neighborhood in a sparse 
Erdos-Renyi graph looks like a Galton- Watson tree. Not surprisingly, this 
is still true in Q(n, -, -). Moreover, we can show that the labels in a small 
neighborhood look as though they were obtained by running a Markov pro- 
cess on the tree. After making this connection suitably precise, we can apply 
the Kesten-Stigum reconstruction threshold, which says that the labels at 
the leaves of the tree don't tell us anything about the label of the root. From 
there, it doesn't require a great leap of faith to believe that if the boundary 
of some neighborhood centered at u gives us no information about au, then 
the label of some far-off vertex v won't tell us anything either. 

Our second result gives a threshold for a different question. So far we 
have been only considering P„ = g{n, ^, ^). Now let = g{n, ^) be the 
Erdos-Renyi model that has the same average degree as P„. If we were to 
give you a graph G which was drawn from either P„ or P^, would you be 
able to tell which one it came from? It turns out that the answer is "yes" 
when (a - b)^ > 2(a + b) and "sometimes" when (a - b)^ < 2(a + b). The case 
(a - b)^ - 2{a + b) remains open. 

Theorem 2.2. // (a - 6)^ > 2(a + 6) then P^ and P,^ are asymptotically 
orthogonal. In other words, there exist events An such that P„(A„) 1 and 

KiAn) - 0. 

If {a - b)^ < 2{a + b) then Pn and ¥'„ are mutually contiguous i.e., for a 
sequence of events An, P„(^n) if, and only i/, P^(^„) 0. 

We should emphasize that P„ and P^ do not converge to one another, 
even below the threshold. In fact, as long as a t b, one can tell with prob- 
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ability strictly bigger than whether a given graph came from P„ or P^. 
However, the probability of success cannot converge to 1 if (a-6)2 < 2{a + h). 

The two parts of Theorem 12.21 require two separate proofs. The first 
part is quite straightforward: we show that you can tell which model a 
graph comes from just by counting the number of short cycles that it has. 
It's well-known that the number of /c-cycles in P,^ is approximately Poisson- 
distributed with mean \{^^)^ ■ We will show that the number of A;-cycles 
in P„ is approximately Poisson-distributed with mean ^((^)^ + (^)'^)- 
It's then not hard to see that by taking k to increase slowly with n, we can 
distinguish between the corresponding Possion random variables as long as 
(a -5)2 > 2{a + h). 

For the second part of Theorem 12.21 we will show that the random vari- 

IP (G^ 

ables pT^^jj don't have much mass near or oo. Since the margin of P^ is 
somewhat complicated to work with, the first step is to enrich the distribu- 
tion P^ by adding random labels. Then we show that the random variables 
p"(g'ct) do^'t have mass near or oo. This essentially involves estimat- 
ing a partition function, for which we use the small subgraph conditioning 
method. 

We briefly note that Theorem 12.21 has implications for parameter esti- 
mation. 

Proposition 2.3. Consider the problem of inferring the parameters a,b 
from a single sample. Then there exists a consistent estimator for the pa- 
rameters a and b from a single sample if (a - b)^ > 2(a + b). There is no 
consistent estimator for the parameters a, b if {a - b)^ < 2(a + b) 

Indeed, our cycle-counting results provide an estimator for a and b which 
is consistent when (a - 6)^ > 2(o + b) On the other hand, we will show that 
the second half of Theorem 12.21 implies that no estimator can be consistent 
when (a - b)"^ < 2(a + b). 

3 Counting cycles 

The main result of this section is that the number of /c-cycles of G ~ P^^ is 
approximately Poisson-distributed. We will then use this fact to show the 
first part of Theorem 1 2. 21 Actually, Theorem [22] only requires us to calculate 
the first two moments of the number of fe-cycles, but the rest of the moments 
require essentially no extra work, so we include them for completeness. 
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Theorem 3.1. Let Xk^n be the number of k-cycles of G, where G ~ Pn- If 
k = 0(log^/^(n)) then 

^M-Pois(^((a + 6)^ + (a-6)^)). 

Before we prove this, let us explain how it implies the first part of The- 
orem [221 From now on, we will write Xj^^ instead, of Xj^^ ^. 

Proof of the first part of Theorem \2.B . Let's recall the standard fact (which 
we have mentioned before) that under P^, X^ Pois^^^pj-)- With this 
and Theorem 13.11 in mind, 

(a + b)'' + (a-b)'' 
EpXfc,VarpXfc^^ ^ ^ ^ 



Set k - k(n) - log^^^n (although any sufficiently slowly increasing func- 
tion of n would do). Choose p such that ^ > P > \f^^- Then VarpX^ 
and Varp'Xfc are both o{p^^) as k ^ oo. By Chebyshev's inequality, Xf^ < 
Ep'Xfc + p'^ P'-a.a.s. and X^ > EpX^ - p'' P-a.a.s. Since EpX^ - Ep/X^ = 
^(^)'' = w(p'''), it follows that EpX^ - p^ > Ep/X^ + p'' for large enough 
k. And so, if we set An - {-^fc(n) ^ lEp'Xfc(„) + p^} then F'{An) 1 and 
U) 0. □ 



Now we will prove Theorem 13.11 using the method of moments. Recall, 
therefore, that if y ~ Pois(A) then EYj-^j = A™", where Y[m] denotes the 
falling factorial Y (Y - !)■ ■ -(Y - m + 1) . It will therefore be our goal to show 

that E,{Xk)[m] ^ ( ^"^^fc2*^+° ) • turns out that this follows almost 
entirely from the corresponding proof for the Erdos-Renyi model. The only 
additional work we need to do is in the case m-1. 

Lemma 3.2. If k - o{\/n) then 

Proof. Let t>o, • • . , "Wa;-! be distinct vertices. Let y be the indicator that 
VQ...Vk-i is a cycle in G. Then EpX^ = (fc) *'^2^^' ^P^' compute 
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KpY . Define to be the number of times in tfie cycle vi . . .Vk that a^. t Cvi+i 
(with addition taken modulo k). Then 



k-mim 



m=0 m=0 

On the other hand, we can easily compute P(N - m): for each i - 0,. . . , k-2, 
there is probability ^ to have dt,. = cr^i+i, and these events are mutually 
indepedent. But whether ct„j. ^ = is completely determined by the other 
events since there must be an even number of i e {0, . . . , A; - 1} such that 
CTv, * cr^.^i- Thus, 

P(A^ = m) = Pr( Binom (fc - 1, e {m - 1, m}) 

\\ni-l/ \ m If \ml 

for even m, and zero otherwise. Hence, 

m even \Tn/ 

The second part of the claim amounts to saying that nj-^,] ~ n^, which is 
trivial when k - o{\/n). □ 

Proof of TheoremlEJi Let /i = ^((a + 6)'= + (a-6)'=); our goal, as discussed 
before Lemma 13.21 is to show that E(Xfc)[-„] /i™. Note that (X^)]-^] is 
the number of ordered m-tuples of fc-cycles in G. We will divide these m- 
tuples into two sets: A is the set of m-tuples for which all of the fc-cycles are 
disjoint, while B is the set of m-tuples in which at least one pair of cycles 
is not disjoint. 

Now, take (Ci,...,Cm) e A. Since the Ci are disjoint, they appear 
independently. By the proof of Lemma 13.21 the probability that cycles 
Ci , . . . , Cm are all present is 



^~kmr,-km 



Since there are (^'^) elements of A, it follows that the expected number 
of vertex-disjoint m-tuples of A;-cycles is 

/ n \ {km)\ -kmc^-km ( ( , h\k , r m 
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It remains to show, therefore, that the expected number of non-vertex- 
disjoint m-tuples converges to zero. Let Y be the number of non-vertex- 
disjoint m-tuples, 

m 

E ni{c,cG}. 

(Ci,...,C„)£Bi=l 

Then the distribution of Y under P is stochasticahy dominated by the dis- 
tribution of Y under the Erdos-Renyi model Q{n, ) . It's weU-known 
(see, eg. [3|, Chapter 4) that as long as A; = 0(log^''^n), -> under 
Q{n, ^) for any c; hence WY under P also. □ 



4 Finding a density 

In this section, we will prove the second part of Theorem 12.21 The general 
direction of this proof was already described in the introduction, but let's 
begin here with a slightly more detailed overview. Recall that P^ denotes 
the Erdos-Renyi model Q{n, ^). The first thing we will do is to extend P' 
to be a distribution on labelled graphs. In order to do this, we only need to 
describe the conditional distribution of the label given the graph. We will 
take 

where is the normalization constant for which this is a probability. 

Now, our goal is to show that pr^^'^j is well-behaved; with our definition of 
W'^{a\G), we have 

Fn(G,a) ^ ^n{cj)Zn{G) ^ 

Thus, the second part of Theorem 12.21 reduces to the study of the partition 
function Zn{G). To do this, we will use the small subgraph conditioning 
method. This method was developed by Robinson and Wormald [231124) in 
order to prove that most d-regular graphs are Hamiltonian, but it has since 
been applied in many different settings (see the survey [27] for a more de- 
tailed discussion). Essentially, the method is useful for studying a sequence 
Yn{Gn) of random variables which are not concentrated around their means, 
but which become concentrated when we condition on the number of short 
cycles that Gn has. Fortunately for us, this method has been developed 
into an easily applicable tool, the application of which only requires the 
calculation of some joint moments. The formulation below comes from [27] . 
Theorem 4.1. 
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Theorem 4.1. Fix two sequences of probability distributions and P„ on a 
common sequence of discrete measure spaces, and letYn-^ be the density 
o/P„ with respect to P„. Let > and 5^ > -1 be real numbers. For each 
n, suppose that there are random variables - ^^(n) e N for k > 3 such 
that 

(a) For each fixed m> 1, {Xk{n)}^^^ converge jointly under P^ to indepen- 
dent Poisson variables with means A^; 

(b) For every ji, . . . , e N, 
(c) 



^ Afc<5fc < oo; 
fc>3 



(d) 



Then P^ and P„ are contiguous. 



exp [Y^Xk^l 

\fe>3 



In our application of Theorem 14. 1 1 the discussion at the beginning of this 
section imphes that y„ = Yn{G) - 2""pf^^. We will take Xk{n) to be the 
number of fc-cycles in G„. Thus, condition (a) in Theorem 14.11 is already 

well-known, with A^ = 2l(^)^' This leaves us with three conditions to 
check. We will start with (d), but before we do so, let us fix some notation. 

Let a and r be two labellings in {±}". Take S - S{a) - {u ■ - +} 
and T - T{t) - {u : - +}. It is common practice to write \S\ for the 
cardinality of S, but we will often omit the | • | symbol, especially where S 
appears in an exponent. We will also omit the subscript n in P„ and P^, and 
when we write n(«,t;)) we mean that u and v range over all unordered pairs 

of distinct vertices u,v & G. Let t (for "threshold") be defined hy t - . 

For the rest of this section, G ~ P'. Therefore we will drop the P' from 
Ep' and just write E. 
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4.1 The first two moments of ¥„ 



Since - ^l^^j^, Ey„ = 1 trivially. Let's do a short computation to double- 
check it, though, because it will be useful later. Define 



2a 
a+b 
2b 

a+b 



if {u,v)€EnS 
if {u,v) e ^n5" 



and define Vuv by the same formula, but with a and S replaced by r and T. 
Then 

Yn = 2-" ^ n ^™ 

and 



i-2n 



E n 



(7,Te{±}" 

Since {Wuv}(u,v) are independent given a, it follows that 

CTe{±}" (m,d) 



and 



(7,Te{±}" ("ii,!") 



Thus, to compute El^, we should compute KWuv, while computing 
involves computing KWuvVuv 

Lemma 4.2. For any fixed a , 

ET^„„(G,a) = 1. 
Proo/. Suppose e S. Then P'((u,t;) e = go 



2a a + 6 



n - a 



a + b 2n n- {a + b)/2 
The case for {u,v) e S"^ is similar. 



/ a + b\ a 
\ 2n / n 



+ 1-- = 1. 

n 



□ 



Nonwithstanding the fact that computing EY^ is trivial anyway, Lemma [4.2l 
and ([1]) together imply that Ey„ = 1. Let us now move on to the second 
moment. 
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Lemma 4.3. // {u,v) e S nT or (n, v) & n then 

n 2{a + b) An^ 



Otherwise, 



n 2{a + b) Av? 
Proof. Suppose {u,v) e S nT. Then 

/ 2a \^ a + b ( n-a \^ / a + b\ 
\a + hl 2n \n- {a + b)/2 J \ 2n I 

2a' {l-^f 



n(a + b) 1-^ 



2a^ / o,\' I a + b {a + b) 



2 



n{a + b) \ n) \ 2n An"^ 

, 1 {a-bf {a-bf ^. _3, 

= 1 + - ■ '— + + 0(n ■^). 

n 2{a + b) An^ ^ ' 

The computation for (n,u) e 5^ n T"^ is analogous. 

Now assume {u,v) e S" n T^. By a very similar computation, 

(a + 6)2 2n {l-^y V 2n / 

n 2(a + 6) 4n2 ^ ^ 

The computation for {u,v) € S'^ nT is analogous. □ 

Given what we said just before Lemma 14.21 we can now compute 
just by looking at the sizes of SnT, S"^nT and so on. To make this easier, we 
introduce another parameter, p - p{a,T) - ^ 'EiO'iTi. Then some elementary 
counting gives 

2 

\S nT\ + \S' nT'\ ^ {1 + p')— -- 
I II ' ^ M 2 

|5nr'=| + |5"nr| = (l-p'^) — . 
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Lemma 4.4. 

-t/2-i2/4 

Ey„2. (1 + 0(1))^ 



Before we proceed to the proof, recall (or check, by writing out the Taylor 
series of the logarithm) that 



as n ^ oo. 

Proof. Define 7n = ^ + ^4"''] ; note that 



(1 + jrrf = (1 + o(l)) exp ( + tn - I 

(1 - 7„)"^ = (1 + 0(1)) exp I -^^^ - in - ^ 



4 

(l + 7nr = (l + o(l))exp(t). 



Then, by Lemma 14.31 



= (l.„(l))e-'/-""Eexp(((<i^.t„)). 

Computing the last term would be easy if p\/n were normally distributed. 
Instead, it is binomially distributed, which - unsurprisingly - is just as good. 
To show it, though, will require a slight digression. 

Lemma 4.5. If e {±} are taken uniformly and independently at random 
and ^ EiLi then 

Eexp{sZl/2) ^ 



whenever s < 1. 
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Proof. Since z i-s- ex.p{sz^ /2) is a continuous function, the central limit the- 
orem implies that exp(sZ^/2) 4- exp(sZ^/2), where Z ~ ^''(O,!). Now, 
Eexp(sZ^/2) = and so the proof is complete if we can show that the 

sequence exp(sZ^/2) is uniformly integrable. But this follows from Hoeffd- 
ing's inequality: 

Pr(exp(.Z^/2) > M) = Pr > ^^^j < M-l/^ 

which is integrable near oo (uniformly in n) whenever s < 1. □ 

To finish the proof of Lemma I4.4t take Z„ as in Lemma 14.51 and note 
that 

4.2 Dependence on the number of short cycles 

Our next task is to check condition (b) in Theorem l4.1l Note, therefore, that 
[X3]j3---[Xm]j„ is the number of ways to have an ordered tuple containing 
jd 3-cycles of G, ja 4-cycles of G, and so on. Therefore, if we can compute 
WYn^H where 1h indicates that any particular union of cycles occurs in Gn, 
then we can compute Ey„[X3]m3---[^m]jm- Computing 'EYn^u is the main 
task of this section; we will do it in three steps. First, we will get a general 
formula for EY^l/f in terms of H. We will apply this general formula in the 
case that H is a single cycle and get a much simpler formula back. Finally, 
we will extend this to the case when H is a union of vertex-disjoint cycles. 

As promised, we begin the program with a general formula for El//1^. 
Let H he a graph on some subset of [n], with = m. With some 

slight abuse of notation. We write 1h for the random variable that is 1 
when H cG, and F'{H) for the probability that H cG. 

Lemma 4.6. 

o-e{±1}™ {u,v)eE(H) 

where 

fj^ tf{u,v)^Sia) 
I ^ otherwise. 
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Proof. We break up o" e {±1}" into ((Ti,cr2) e 

and sum over the two parts separately. Note that if {u,v) e E(H) then 
WuviG,a) depends on a only through ai. Let D{H) - E{G) \ E{H), so 
that (ujf) e D{H) implies that Wuv and 1/;- are independent. Then 

=2-"^((ei^ n n ^Wu^ 

o-i \ {u,v)eE{H) («,«)6D(//) / 

ci (u,v)eE{H) 

because if (u, v) e D{H) then, for every o", Lemma r4.2l savs that ¥.Wuv{G, a) - 
1. To complete the proof, note that if {u,v) e E{H) then for any u, 
Wu^,(G,(t) e Wuv{(^) on the event H <^G. □ 

The next step is to compute the right hand side of Lemma [4.6l in the case 
that H is a, cycle. This computation is very similar to the one in Lemma [3. 21 
when we computed the expected number of /c-cycles in ^(n, ^,^). Essen- 
tially, we want to compute the expected "weight" of a cycle, where the 
weight of each edge depends only on whether its endpoints have the same 
label or not. 

Lemma 4.7. If H is a k-cycle then 

a£{±l}H {u,v)<iE{H) \ \a + 0/ I 

Proof. Let ei, . . . , be the edges of H. Provided that we renormalize, we 
can replace the sum over a by an expectation, where a is taken uniformly 
in {±1}^. Now, let be the number of edges of H whose endpoints have 
different labels. As discussed in the proof of Lemma 13.21 Pr(A^ = j) = 
2-A;+i^fc^ for even j, and zero otherwise. Then 

IE. n wU<y)-M^^ 

\a + b/ 



15 



Extending this calculation to vertex-disjoint unions of cycles is quite 
easy: suppose H is the union of cycles Hi. Since Wuv{cr) only depends on o"u 
and cj„, we can just split up the sum over a e {±}^ into a product of sums, 
where each sum ranges over {±}^\ Then applying Lemma 14.71 to each Hi 
yields a formula for H. 

Lemma 4.8. Define 




If H -\JiHi is a vertex-disjoint union of graphs and each Hi is a ki-cycle, 
then 

n ^™(^,cT)=2i^in(i+4o. 

o-e{±l}» {u,v)eE{H) i 

We we need one last ingredient, which we hinted at earlier, before we can 
show condition (b) of Theorem 14.11 We only know how to exactly compute 
EYnl/f when H is a disjoint union of cycles. Now, most tuples of cycles 
are disjoint, but in order to dismiss the contributions from the non-disjoint 
unions, we need some bound on EY^lj^ that holds for all H: 

Lemma 4.9. For any H , 

o-e{±l}» (u,v)^E{H) 

Proof. 

, . 2max{a,6| 

wuv{(y) < < 2 

a + b 

for any i,j,H and a. □ 

Finally, we are ready to put these ingredients together and prove condi- 
tion (b) of Theorem 14. 1[ For the rest of the section, take 6k - (f^f)*' as it 

was in Lemma 14.81 Also, recall that = j^i^^)^ limit of EX^ as 

n oo. 

Lemma 4.10. Let be the number of k-cycles in G. For any j^, . . . ,jm e 

m m 
k=3 fc=3 
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Proof. Set M - Y,k krrik- First of all, 

Hi,...,Hj i 

where the sum ranges over all j-tuples of distinct A:-cycles, and 1h indicates 
the event that the subgraph H appears in G. Thus, 

m m jk 

where the sum ranges over all M-tuples of cycles {Hki)k<m,i<jt, for which 
each Hki is an fc-cycle, and every cycle is distinct. Let H. be the set of 
such tuples; let ^ c be the set of such tuples for which the cycles are 
vertex-disjoint, and let B = Ti \ A. Thus, if H - UH/^i for {Hki) e A, then 

^YniH-Ylii + ^kY'^'in) 

k 

by Lemmas 14.61 and 14.81 Note also that standard counting arguments (see, 
for example, 0, Chapter 4) imply that \A\¥'{H) ^ IlfcA^^ 

On the other hand, if (-fffci) ^ B then H ■- [Jki^ki has at most M - 1 
vertices, M edges, and its number of edges is strictly larger than its number 
of vertices. Thus, P'(i7)(|^|) 0, so Lemmas 14.61 and 14.91 implv that 

Y, EYnlH<^'{H)\H\l(^'']2^' ^0, 

H'~H M^K 

where the sum ranges over all ways to make an isomorphic copy of H on n 
vertices. Since there are only a bounded number of isomorphism classes in 

{\jHki:{Hki)€B}, 

ki 

it follows that ^.H^^n^H 0, where the sum ranges over all unions of 
non-disjoint tuples in H. Thus, 

k=3 \(Hk^)eA (Hk^)iB J 

= |A|P'(i7)n(l+4F^+o(l) 

k 

^Yli^kii + Sk))'". □ 

k 
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To complete the proof of Theorem 12.21 ^ote that S^Xk - Thus, 
Zk>3^k^k = 5(log(l-t)-t-t^/2). When t < 1, this (with Lemma|13I) proves 
conditions (c) and (d) of Theorem 14. 1[ Since condition (a) is classical and 
condition (b) is given by Lemma [4. 101 the conclusion of Theorem 14.11 implies 
the second part of Theorem 12.21 

5 Non-reconstruction 

The goal of this section is to prove Theorem 12.11 As we said in the intro- 
duction, the proof of Theorem 1 2 . 1 1 uses a connection between G{n, ^, and 
Markov processes on trees. Before we go any further, therefore, we should 
define a Markov process on a tree and state the result that we will use. 

Let T be an infinite rooted tree with root p. Given a number < e < 1, 
we will define a random labelling r e {±}"^- First, we draw Tp uniformly in 
{±}. Then, conditionally independently given Tp, we take every child u of 
p and set = Tp with probability 1 - e and Tu - -Tp otherwise. We can 
continue this construction recursively to obtain a labelling r for which every 
vertex, independently, has probability 1 - e of having the same label as its 
parent. 

Back in 1966, Kesten and Stigum [19] asked (although they used some- 
what different terminology) whether the label of p could be deduced from 
the labels of vertices at level R of the tree (where R is very large). There are 
many equivalent ways of stating the question. The interested reader should 
see the survey [22], because we will only mention two of them. 

Let Tr = {u&T: d{u, p) < R} and define OTr = {u€T: d{u, p)^R]. We 
will write tt^ for the configuration r restricted to Tr. 

Theorem 5.1. Suppose T is a Galton-Watson tree where the offspring dis- 
tribution has mean d> 1. Then 

lim Pr{Tp = +\TdTn) = \ a.s. 

if, and only if d{l - 2e)^ < 1. 

In particular, if d(l - 2e)^ < 1 then tqtr contains no information about 
Tp. Theorem 15.11 was established by several authors over the course of more 
than 30 years. The non-reconstruction regime (ie. the case d{l - 2e)^ < 1) 
is the harder one, and that part of Theorem 15.11 was first proved for d-ary 
trees in [2], and for Galton-Watson trees in [12]. This latter work actually 
proves the result for more general trees in terms of their branching number. 
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We will be interested in trees T whose offspring distribution is Pois(^^) 
and we will take 1-e - . Some simple arithmetic applied to Theorem 15.11 
then shows that reconstruction of the root's label is impossible whenever 
(a-b)^ < 2{a + b). Not coincidentally, this is the same threshold that appears 
in Theorem 12.11 

5.1 A neighborhood of G looks hke T 

The first step in applying Theorem 15.11 to our problem is to observe that a 
neighborhood of {G,a) ~ G{n, ^, ^) looks like {T,t). Indeed, fix p e G and 
let Gr be the induced subgraph on {u e G : d{u,p) < R}. 

Proposition 5.2. Let R - R{n) - [ ioiog(2(a+fe)) ^og^J- There exists a cou- 
pling between {G,a) and {T,t) such that {GrtCTGr) - {T]i,TTjf^) a.a.s. 

For the rest of this section, we will take R - [ giog(2(a+b)) ^OS^J- 

The proof of this lemma essentially follows from the fact that (T, r) 
can be constructed from a sequence of independent Poisson variables, while 
{Gji,aGji) can be constructed from a sequence of binomial variables, with 
approximately the same means. 

For a vertex v € T, let be the number of children of v; let be 
the number of children whose label is t„ and let Y* - Yy - Y^ . By Poisson 
thinning, Y^ ~ Pois(a/2), Y* ~ Pois(6/2) and they are independent. Note 
that (T, r) can be entirely reconstructed from the label of the root and the 
two sequences (Yf), (Y*). 

We can almost do the same thing for Gn, but it is a little more compli- 
cated. We will write V = V{G) and Vr = V{G) \ V{Gr). For every subset 
W cV, denote by W* and the subsets of W that have the correspond- 
ing label. For example, - {v € Vr = +}. For a vertex v e OGr, 
let Xy be the number of neighbors that v has in Vr', then let Xy be the 
number of those neighbors whose label is and set X* - Xy - X^. Then 
Xy ~ Binom(|l^'^"|, a), X* ~ Binom(|V^~'^"|,6) and they are independent. 
Note, however, that they do not contain enough information to reconstruct 
Gr: it's possible to have u,v € dGr which share a child in Vr, but this can- 
not be determined from Xu and Xy. Fortunately, such events are very rare 
and so we can exclude them. In fact, this process of carefully excluding bad 
events is all that needs to be done to prove Proposition 15.21 

In order that we can exclude their complements, let us give names to 
all of our good events. For any r, let Ar be the event that no vertex in Vr 
has more than one neighbor in Gr- Let Br be the event that there are no 
edges within dGr- Clearly, if Ar and Br hold for all r = 1,. . . ,R then Gr 
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is a tree. In fact, it's easy to see that Ar and Br are the only events that 
prevent {X^ ,X*}y^G from determining {Gr^ugji)- 

Lemma 5.3. // 

1. {Tr-l,TT^_j) = {Gr-l,(JGr-i); 

2. = and X* = Y* for every u € dGr-i; and 

3. Ar and Br hold 
then iTr,TT^) = (Gr,(TGj. 

Proof. The proof is essentiahy obvious from the construction of X„ and 
Ya, but we will be pedantic about it anyway. The statement (T^^i, r7;,_i ) = 
(Gr-idGr-i) means that there is some graph homomorphism (p : Gr-i T^^i 
such that Uu = t^(u)- If s dGr-i and X^ = Y^^^-^ and X* - Y*^^-^ then we can 
extend 4> to Gr~iuM{u) while preserving the fact that - t^(v) ^- 
the event Ar, this extension can be made simultaneously for all u e dGr-i, 
while the event Br ensures that this extension remains a homomorphism. 
Thus, we have constructed a label-preserving homomorphism from {Gr^crCr) 
to {Tj.,TTr), which is the same as saying that these two labelled graphs are 
equal. 

From now on, we will not mention homomorphisms; we will just identify 
u with (p(u). □ 

In order to complete our coupling, we need to identify one more kind of 
good event. Let Cr be the event 

Gr = {\dGs\ < 2'{a + by logn for all s < r + 1}. 

The events Gr are useful because they guarantee that Vr is large enough 
for the desired binomial-Poisson approximation to hold. The utility of Gr is 
demonstrated by the next two lemmas. 

Lemma 5.4. 

F{Cr\Cr-i,a)>l-n-^°^^^/^\ 
Moreover, \Gr \ = 0{n}l^) on Gr-i- 
Lemma 5.5. For any r, 

¥{Ar\Gr-ua) > l-0(n"2/^) 
¥{Br\Gr-i,a) >l-0{n^l*). 
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Proof of Lemma \5.4\ First of all, is stochastically dominated by Binom(n, 
for any v. On C^-i, \dGr\ < 2^(a + byiogn and so |(9Gr+i| is stochastically 
dominated by 

Z ~ Binom (2^(a + bYnlogn, ). 



n 

Thus, 



EZ 



F{-~Cr\Cr-l,(j) < F{Z > 2EZ) < ^0 

by a multiplicative version of Chernoff 's inequality. But 

EZ = 2''(a + by+^ logn > logn, 

which proves the first part of the lemma. 
For the second part, on Cj._i 

R R 

\Gr\ = Y.\9Gr\ <Y,'^"ia + byiogn<{2{a + b))^^Hogn^O{n^'^). □ 

r=l r=l 

Proof of Lemma \5.5[ For the first claim, fix u,f e dGr- For any w €Vr, the 
probability that {u,w) and {v,w) both appear is 0{n~^). Now, \Vr\ < n and 
Lemma 15.41 implies that = 0{v}l^). Hence the result follows from a 

union bound over all triples u,v,w. 

For the second part, the probability of having an edge between any 
particular u,v € dGr is 0(n~^). Lemma 15.41 implies that |(9Grp = 0(n^/^) 
and so the result follows from a union bound over all pairs u,v. □ 

The final ingredient we need is a bound on the total variation distance 
between binomial and Poisson random variables. 



Lemma 5.6. // m and n are positive integers then 

Binom (m^)- Pois(c) . max{l, |m - n|} . 
\ n' TV \ n I 

Proof. A classical result of Hodges and Le Cam [15] shows that 



Bmom m, — I - Fois | < — ^ = C(n ). 

\ n' ^ n 'Wtv 

With the triangle inequality in mind, we need only show that Pois(cm/n) 
is close to Pois(c). This follows from a direct computation: if A < then 
II Pois(A) -Pois(^)||^^ is just 

'e'^^^ -e'^\^\ _^ /i'^ \^^-\^\ 



y ^-^^ — ^—^ < le-^ - e-^\ y^ + e~^y 

feo fc>o k>o 
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Now the first term is ^ - 1 and we can bound (j,^ - < k{iJ, - \)tjf' ^ by 
the mean value theorem. Thus, 

II Pois(A) - Pois(^)||^^ < e^"^ - 1 + e^-^(^ - A) = 0{^^ - A). 

The claim follows from setting /i = c and A = □ 

Finally, we are ready to prove Proposition 15. 2[ 

Proof of Proposition \5.Si Let Q be the event that | < n'^^^. Clearly, 

¥(Q) 1 exponentially fast. 

Fix r and suppose that Cr-i and Q hold, and that {Tr,Tr) - {Gr-,(Tr)- 
Then for each u e dGr, is distributed as Binom(|V7"|, a/n). Now, 

^ + n^'^ > > Id > - \Gr-i\ > ^ - n^'^ - 0{n^'^) 

and so Lemma [5T6l implies that we can couple with such that F{X^ ^ 
Y=) = 0{n-^l^) (and similarly for X* and ¥*). Since \dGr-i\ = 0{n^l^) by 
Lemma 15. 4|, we can find a coupling such that with probability at least 1 - 
0{nr^l^), X^ - and X* = Y* for every u e dGj-i- Moreover, Lemmas 15. 41 
and 15.51 imply , Br and Gr hold simultaneously with probability at least 
l-n^^"^(^/^^ -0(n~^/^). Putting these all together, we see that the hypothesis 
of Lemma 15.31 holds with probability at least 1 - 0(n~^/^). Thus, 

P((G,+i,CT,+i) = iTr+l,Tr+l),Cr\iGr,C7r) = (T„r,),a_i) > 1 - 0(^-1/^). 

But P(Co) = 1 and we can certainly couple (Gi,cti) with (ri,ri). Therefore, 
with a union bound over r = we see that {Gr^ur) - {Tr,tpi) 

a.a.s. □ 



5.2 Non-reconstruction 

We have shown that a neighborhood in G looks like a Galton- Watson tree 
with a Markov process on it. In this section, we will apply this fact to 
prove Theorem 12.11 In the statement of Theorem 12.11 we claimed that 
E((Tp|G,fT„) 0, but this is clearly equivalent to Var((Tp|G, Ui,) 1. This 
latter statement is the one that we will prove, because the conditional vari- 
ance has a nice monotonicity property. 

The idea behind the proof is to condition on the labels of which can 
only make reconstruction easier. Then we can remove the conditioning on 
cr„, because cfqgj^ gives much more information anyway. Since Theorem 15.11 
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and Proposition 15.21 imply that cannot be reconstructed from cfqgji-, we 
conclude that it cannot be reconstructed from either. 

The first step is to prove that, once we have conditioned on (TdGu^ we 
can remove the conditioning on 0"^,. If it|G were distributed according to a 
Markov random field, this would be trivial because conditioning on CTdGn 
would turn and Op independent. For our model, unfortunately, there are 
weak long-range interactions. However, these interactions are sufficiently 
weak that we can get an asymptotic independence result for separated sets 
as long as one of takes up most of the graph. 

In what follows, we say that X - o{a{n)) a.a.s. if for every e > 0, Pr(|X| > 
ea{n)) as n -> oo, and we say that X - 0{a{n)) a.a.s. if 

limsuplimsupPr(|X| > Ka{n)) - 0. 

Lemma 5.7. Let A = A{G),B = B{G),C = C(G) c V be a (random) 
partition of V such that B separates A and C in G. If \Au B\- o(\/n) for 
a.a.e. G 

F{aAWBuC,G) = (1 + o{l))¥{aAWB,G) 
for a.a.e. G and a. 

Note that Lemma 15.71 is only true for a.a.e. a. In particular, the lemma 
does not hold for a that are very unbalanced (eg. a - +^). 

Proof. As in the analogous proof for a Markov random field, we factorize 
P(G, o") into parts depending on A, B and C. We then show that the part 
which measures the interaction between A and G is negligible. The rest of 
the proof is then quite similar to the Markov random fields case. 
Define 



''^ if {u,v) e E{G) and - 

^ if {u,v) e E{G) and fT„ * 

1 - f if (n, v) i E{G) and o-„ = ct„ 

1-^ if (n,?;) ^ £;(G) and o-„ * o-„. 



For arbitrary subsets Ui,U2 c V, define 

Qui,U2 ^ Qui,U2{G,a) ^ Yl i'uv{G,a). 

(If Ui and U2 overlap, the product ranges over all unordered pairs {u,v) 
with u i= v; that is, if {u,v) is in the product then {v,u) is not.) Then 

rF{G, a) = F{G\a) = QauB,AubQbuC,cQa,c- (3) 
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First, we will show that Qa,c is essentially independent of a. Take a de- 
terministic sequence Un with an\\fn oo but a„|A| = o(n) a.a.s. Define 
■sa(o-) = E^eA^-t) and sc(ct) = Y^v^c^v and let 

17 = {r e {±}^ : |sc(r)| < a„,} 

r^c; = il{/((T) = {r 6 {±}^ -.Tu^du and |sc(t)| < q„}. 

By the definition of a„, if r e then |sA(T)sc(r)| < |A|a„ = o(n) a.a.s. 
Thus, T € Q implies 

QA,ciG,T)^ n ^uv{G,t) 

ueA,veC 

= 1--) 1--) 

. xx/ a\\M\c\l2, h\\A\\c\l2 
= (l + o(l))l-- 1-- a.a.s. (4) 

\ n' ^ n' 

where we have used the fact that u & A, v & C implies that (u, v) ^ E{G), and 

thus ijjuv is either 1 - ^ or 1 - Moreover, 1 - ^ appears once for every pair 

e ^ X C where = t„. The number of such pairs is |^+||C+| + |A_||C_| 

where - {u & A: Tu - +} (and similarly for C+, etc.); it's easy to check, 

then, that 2(|A+||C+| + |A_||C_|) = |A||C|+sasC) which explains the exponents 

in g}. 

Note that the right hand side of dH depends on G (through A{G) and 
C{G)) but not on r. Writing 2-'^K{G) for the right hand side of gD, ^ 
implies that if r e $7 then 

P(G, r) = (1 + o{1))K{G)QauB,a^b{G, t)QbuC,c{G, t) (5) 

for a.a.e. G. Moreover, an/\/n -> oo implies that o" e 17 for a.a.e. a, and so 
for any U^U{G), F{au,G) = {1 + o{l))F{au,(T € n,G) a.a.s; therefore, 

F{au,G) = il + o{l))F{au,a€n,G) 
.(l + o(l)) ^ P(r,G) 

reQ[7((T) 

= (l + o(l))K(G) Qaub,aMG,t)Qbuc,c{G,t) (6) 

Tent7(o-) 

for a.a.e. G and u. (Note that the o(l) term can be taken to depend only on 
G, so there is no problem in pulling it out of the sum.) Applying Q twice, 

KC7AUB,G) 



¥iaA\<JB,G) 



n^B,G) 

(1 I q(^i^^ '^''^^aljbQa'~'B,a<jb{G,t)Qbuc,c{G,t) 

Ere^s QAuB,AuBiG, T )QBuC,ciG, T ) 
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Note that Qui,U2{t) depends on r only through r(/^u(/2- particular, in the 
numerator of ([7]), QauB,Aub{G,t) doesn't depend on r since we only sum 
over T with tauB - (^AuB- Hence, the right hand side of ^ is just 

(l + o(l))-^ rf^ r, (8) 

y EreHBuC QAuB,AuB{G,T)j\^ZrenAuB QbuC,c{G,t)^ 

where we could factorize the denominator because with tb fixed, QauB,AuB 
depends only on ta, while QbuC,c depends only on tc- Cancelling the 
common terms, then multiplying top and bottom by QBuC,c{G,a), we have 

®. (1 + 0(1)) Q^-B,AuB{G,a) 



= (l + o(l)) 
= (l + o(l)) 



EreS^suC QauB,Aub{G, T ) 

QauB,Aub{G, a )QbuC,c{G, a ) 

Eref^suC QauB,Aub{G, T )QbuC,c{G, T ) 

P(G,(t) 



nG,aBuc) 
= {l+o{l))¥{aA\crBuC,G) a.a.s. 

where the penultimate line used ([6]) for the denominator and ^ (plus the 
fact that (T e a.a.s.) for the numerator. On the other hand, recall from d?]) 
that dlD = (1 + o(l))P(o-AkB,G') a.a.s. 

□ 

Proof of Theorem \2. li By the monotonicity of conditional variances, 
Vav{ap\G,ay,aQGR) < Var(o-p|G, fT^,). 

Since |G_r| = o(v^) a.a.s. and v ^ Gr a.a.s, it follows from Lemma ISTl 
that ay and dp are a.a.s. conditionally independent given cfqgr and G. Thus, 
Var (cTp I G, cr^, (TaGfl) ^ Var((jp|G, craGfl)- Now Proposition 15.21 implies that 
I Var(crp|G, (TaG^) - Var(rp|r, rgrn)! ^ 0, but Theorem 15. II savs that 

Var (rp I T, TQTjj) 1 a.a.s. 
and so Var((7p|G, (Tgc^) -> 1 a.a.s. also. □ 

6 Parameter Estimation 

We now prove Proposition! 
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Proof. We first show that Theorem 13.11 gives us an estimator for a and b 
that is consistent when (a - b)^ > 2(a + b). First of all, we can estimate 
d := (a + b)/2 consistently by simply counting the number of edges. Thus, if 
we can estimate f ■- {a - b)/2 consistently then we do the same for a and b. 
Our estimator for / is 

f^{2kXk-d')'/\ 

where d is some estimator with d -* d P-a.a.s. and k - k{n) increases to 
infinity slowly enough so that k{n) - o(log"'^/^ n) and d^-d^ P-a.a.s. Take 

yi±F < p < = /; by Chebyshev's inequality, 2kXk - d^ e [f^ - . f + P^^ 
P-a.a.s. Since k = k{n) ^ oo, p'' ^ o{f). Thus, 2kXk - d^ ^ {I + o{l))f 
P-a.a.s. Since dl" - d^ ^ 2kXk - d!" = (1 + o{l))f^ P-a.a.s. and so / is a 
consistent estimator for /. 

We next apply the second half of Theorem l2.2l to show that no estimator 
can be consistent when (a-b)^ < 2{a+b). In fact, if d and b are estimators for 
a and b which converge in probability, then their limit when (a-b)^ < 2{a+b) 
depends only on a + b. To see this, let a,/3 be another choice of parameters 
with (a - < 2(a + /3) and a + (3 - a + b; Let Q„ = Qn{a,f3); take a* to be 
the in-probability limit of a under P„ and a* to be its limit under Q„. For 
an arbitrary e > 0, let be the event \d - a*\ > e; thus, P„(A„) 0. By 
Theorem l2.21 P^(A„) also. Since a + f3 - a + b, we can apply Theorem l2.2l 
to Q„, implying that Qn(^n) and so a* - a* . That is, a converges to 
the same limit under Q„ and P„. □ 
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